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TITLE OF THE INVENTION 
MULTIPLE CONTROLS FOR MOLECULAR GENETIC ANALYSES 

5 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims the priority of U.S. Provisional 
Application No. 60/303,825 filed July 9, 2001 entitled, MULTIPLI 
CONTROLS FOR MOLECULAR GENETIC ANALYSES , the whole of which is 
10 hereby incorporated by reference herein. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR 

DEVELOPMENT 
N/A 

15 

BACKGROUND OF THE INVENTION 
Molecular genetic testing is becoming ever more important 
in prenatal diagnosis, maternal and newborn screening, screening 
for genetic disease in symptomatic and at-risk patients, 

20 identity and paternity testing, characterizing disease-causinc 
organisms contracted from others or released by terrorists, 
characterizing recombinant genes in food, confirming the 
pedigrees of animals or plants, and identifying criminals. Tc 
conduct robust testing, reliable clinical laboratories are 

25 required by the College of American Pathologists to use normal 
and affected haplotype controls each time mutations oi 
polymorphic sites are tested. A positive control is defined as 
extracted total DNA from an individual carrying an affected oz 
polymorphic sequence. This DNA can be amplified and usee 

3 0 directly as a test control or, in the case of mammals, cells can 
be transformed from individuals identified and sampled with the 
desired haplotypes so that an expanded supply of control DNA is 
available. However, a new test can be developed more rapidly 
than control nucleic acid samples from patients with abnormal 
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allelic sequences can be obtained, as can be illustrated by the 
example of cystic fibrosis mutation testing in humans . 

Cystic fibrosis gene testing began when mutations in the 
cystic fibrosis transmembrane receptor gene were reported to 
5 result in cystic fibrosis (Riordan et al . , 1989; US Patent Nos . 
5,776,677, 6,2 01,107, and 5,407,796). Over the next several 
years clinical cystic fibrosis testing for multiple mutations 
was offered by dozens of molecular genetics laboratories. These 
multiple PCR tests included amplifying fragments in which normal 

10 sequences could be distinguished from mutant sequences by 
differences in restriction enzyme sites, differences in the 
length of amplified products, or dot blot analysis with 
mutation-specific oligonucleotide probes. Controls were used 
that had been collected from patients with these individual 

15 mutations. Abnormal alleles from a single patient served as a 
control for each mutation specific test. 

The need to acquire all required patient mutations 
sequences has become more immediate for cystic fibrosis testing 
even as other robust clinical tests using different methods have 

20 been under development. Currently, more than 1000 cystic 
fibrosis mutations and more than 50 polymorphic sites have been 
reported in the cystic fibrosis transmembrane receptor (cftr) 
gene. Twenty-five of these have been reported to occur at a 
frequency exceeding 0.1% (Grody et al . , 2001). These 25 

25 mutations have been designated to be included in testing panels 
that would meet the standard of care for screening pregnant 
Caucasian mothers (Grody et al . , 2001). However, at this time, 
only 21 of these mutant alleles have been collected in cells 
that were transformed and are available through a central cell 

3 0 repository for purchase as positive controls. The remaining four 
mutant alleles are currently unavailable. Testing 25 mutations 
with only 21 mutant control sequences without simultaneously 
testing known mutant sequences from the remaining four sites 
decreases test reliability. A more reliable way of obtaining 
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robust positive controls for all kinds of genetic tests is 
clearly desirable. 

5 SUMMARY OF THE IWENTION 

The invention is directed to methods for genetic testing 
and to compositions comprising nucleic acid sequences for use as 
positive controls in any of the genetic testing methods of the 
invention. Compositions constructed according to the inventior 

10 including multiple nucleic acid sequences are the optimal 
controls for simultaneously testing multiple variable nucleic 
acid sequences (i.e., mutations or polymorphisms) at one or more 
DNA or RNA sites in a test subject. Setting up a robust DNA test 
for multiple alleles and/or genomic sites requires the use of 

15 multiple controls that can be tested simultaneously to confim 
that the assay conditions have been maintained in order tc 
differentiate reliably among all tested alleles. With the use 
of compositions according to the invention, such controls can be 
synthesized and no longer need to be obtained from individuals 

2 0 with those sequences. 

Oligonucleotide primers comprising selected polynucleotide 
sequences in the composition of the invention can be synthesized 
chemically and then extended by polymerase following 
hybridization to a longer complementary sequence within a cloned 

2 5 or extracted total genomic DNA sample. Extended primers are 

amplified by PCR or following cloning into selectable vectors 
that are introduced into a host such as bacteria or yeast fox 
propagation. The amplified fragments are then purified, dilutee 
to the correct concentration, and used as positive controls in a 

3 0 method according to the invention. Alternatively, the normal 

sequence to be changed to the mutated or polymorphic form can be 
cloned and then modified by site directed mutagenesis. Several 
single mutant or polymorphic sequences that together comprise a 
composition according to the invention, i.e. a panel of multiple 
3 5 control sequences, can be added individually to single site 
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tests or mixed together or ligated together by further PCR or b] 
cloning into vectors prior to use in individual or multiple: 
tests. 

Thus, compositions of multiple controls can be used tc 
5 minimize the number of control reactions in multiplex assays 
testing multiple sites or to simplify selection and dispensing 
of control aliquots in assays testing mutations or polymorphic 
sites individually. Controls according to the invention can be 
used when testing any genetically transmitted nucleic aci< 

10 sequence. Tested nucleic acid can be derived from DNA and RNi 
viruses, nuclear DNA from single cellular and multicellula] 
organisms, and inherited DNA in cellular organelles. Control 
compositions according to the invention can be used in tests 
such as determining the identity or classification of ai 

15 individual derived from among any categorized organism, 
determining the genetic relationship of individuals to others 
within the same pedigree; screening populations for the 
prevalence of variants, mutations or other traits deemed wortl 
testing; characterizing genetically transmitted diseases; anc 

2 0 identifying and classifying strains of infectious diseases, 
e.g., contracted by accident or following intentional release, 
Controls according to the invention can also be used bv 
organizations testing quality assurance and by companies 
maintaining quality control of manufactured genetic test kits. 

2 5 The method according to the invention includes identifying 

one or more nucleic acid loci of the subject or subjects to be 
examined by the test to be conducted, identifying important 
multiple mutations or polymorphisms associated with the one 03 
more selected loci, and planning and conducting the test, usinc 

3 0 as a positive control one or more nucleic acid fragments 

comprising three or more mutations or polymorphisms associatec 
with the one or more selected loci. These control nucleic acic 
fragments can be used individually in separate, multiple tests 
or mixed together, or ligated together, for amplification by PCI 
3 5 or DNA cloning prior to simultaneous use in individual 02 



WO 03/078570 



PCT/US02/21506 



multiplex tests. Positive controls synthesized according to th 
invention can be used when testing any genetically transmitte 
(heritable) nucleic acid sequence from any organism. 

Kits are also provided to practice the present inventio 
5 conveniently. These kits contain multiple allelic sequences a 
described herein in a single tube that can be aportioned an 
tested as controls within multiple single allelic tests or one 
as a single control for multiple allelic sites or alleles withi 
a multiplex test. The kits contain, e.g., a vial with UN 
10 sequences including two or more mutations or polymorph! 
controls that span the target locus or loci. The kits may als 
include a thermostable polymerase, other amplification reagent 
or restriction enzymes. 

To aid in understanding the invention, several terms ar 
15 defined below: 

"PGR amplification reaction mixture" refers to an aqueou 
solution comprising the various reagents used to amplify 
target nucleic acid. These include: enzymes, aqueous buffers 
salts, target nucleic acid, and deoxyribonucleosid 

2 0 triphosphates. Depending upon the context, the mixture can b 

either a complete or incomplete amplification reaction mixtur 
and the primers may be a single pair or nested primer pairs. 

"PCR amplification reagents" refer to the various buffers 
enzymes, primers, deoxyribonucleoside triphosphates (bot 
25 conventional and unconventional) , and primers used to per for 
the selected amplification procedure. 

"Amplifying" or "Amplification, " which typically refers t 
an "exponential" increase in target nucleic acid, is being use 
herein to describe both linear and exponential increases in th 

3 0 numbers of a select target sequence of nucleic acid. 

"Bind(s) substantially" refers to complementar 

hybridization between an oligonucleotide and a target sequenc 
and embraces minor mismatches that can be accommodated b 
reducing the stringency of the hybridization medium to achiev 
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the desired priming for the PCR polymerases or detection o 
hybridization signal . 

"Hybridizing" refers to the binding of two single strande 
nucleic acids via complementary base pairing. 

"Nucleic acid" refers to a deoxyribonucleotide o 
ribonucleotide polymer in either single- or double-strande 
form, that unless otherwise limited also encompass known analog 
of natural nucleotides that can function in a similar manner a 
naturally occurring nucleotides . 

"Nucleotide polymerases" refers to enzymes able to catalyz< 
the synthesis of DNA or RNA from a template strand usin< 
nucleoside triphosphate precursors. In the amplif icatioi 
reactions of this invention, the polymerases are template' 
dependent and typically add nucleotides to the 3 ■ -end of th< 
polymer being synthesized. It is most preferred that the DNi 
polymerase is thermostable for PCR amplification as described i] 
U.S. Pat. No. 4,889,819, incorporated herein by reference. 

The term "oligonucleotide" refers to a molecule comprise* 
of two or more deoxyribonucleotides or ribonucleotides, 
including primers, probes, nucleic acid fragments to be 
detected, and nucleic acid controls. The exact size of ai 
oligonucleotide depends on many factors including its ultimate 
function or use. Oligonucleotides can be prepared by an} 
suitable method including cloning and restriction enzyme 
digestion of appropriate sequences or direct chemical synthesis 
by a method such as the phosphotries ter method of Narang et al . , 
1979, Meth. Enzymol . 68:90-99; the phosphodiester method oi 
Brown et al . , 1979, Meth. Enzymol. 68:109-151; the 
diethylphosphoramidite method of Beaucage et al . , 1981, 
Tetrahedron Lett. 22:1859-1862; and the solid support method of 
U.S. Pat. No. 4,458,066, each of which is incorporated herein b^ 
reference . 

The term "primer" refers to an oligonucleotide, whether 
natural or synthetic, capable of acting as a point of initiation 
of DNA synthesis under conditions in which synthesis of a primer 
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extension product homologous to a nucleic acid' strand i 
induced, i.e., in the presence of four different nucleosid 
triphosphates and an agent for polymerization (i.e. polymeras 
or reverse transcriptase) in an appropriate buffer and at 
5 suitable temperature. A primer is preferably a single-strande 
oligodeoxyribonucleotide . The appropriate length of a prime 
depends upon its intended use but typically ranges from 15 to 7 
nucleotides. Shorter primer molecules generally require coole 
temperatures to form sufficiently stable hybrid complexes wit 
10 the template. A primer need not reflect the exact sequence o 
the template but must be sufficiently complementary to hybridiz 
to a template. 

The term "primer" may refer to more than one primer. Fo 
instance, if a region shows significant levels of polymorphis: 
15 or mutation in a population, mixtures of primers can be prepare 
that will amplify alternate sequences. A primer can be labeled 
if desired, by incorporating a label detectable b; 
spectroscopic, photochemical, biochemical, immunochemical, o: 
chemical means. Useful labels include p32, fluorescent dyes 

2 0 electron-dense reagents, enzymes (as commonly used in an ELISA) 

biotin, or haptens and proteins for which secondary labele< 
antisera or monoclonal antibodies are available . A label cai 
also be used to "capture" .the primer, so as to facilitate th< 
immobilization of either the primer or a primer extension 
25 product, such as amplified DNA on a solid support. 

The term "reverse transcriptase" refers to an enzyme thai 
catalyses the polymerization of deoxynucleoside triphosphates tc 
form primer extension products that are complementary to c 
ribonucleic acid template. The enzyme initiates synthesis at the 

3 0 3 1 -end of the primer and proceeds toward the 5 1 -end of th< 

template until synthesis terminates . Examples of suitable 
polymerizing agents that convert the RNA target sequence into i 
complementary, DNA (cDNA) sequence are avian myeloblastosis 
virus reverse transcriptase and Thermus thermophilus DNi 
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polymerase, a thermostable DNA polymerase with revers 
transcriptase activity marketed by Perkin Elmer Cetus, Inc. 

As used herein, the term "sample" refers to a collection o 
biological material from an organism containing nucleated cells 
5 This biological material may be solid tissue as from a fresh o 
preserved organ or tissue sample or biopsy; blood or any bloo 
constituents; bodily fluids such as amniotic fluid, peritonea 
fluid, or interstitial fluid; cells from any time in gestatio 
including an unfertilized ovum or fertilized embryo 

10 preimplantation blastocysts, or any other sample with intac 
interphase nuclei or metaphase cells no matter what ploidy (ho' 
many chromosomes are present) . The "sample" may contai: 
compounds which are not naturally intermixed with the biologica 
material such as preservatives anticoagulants, buffers 

15 fixatives, nutrients, antibiotics, or the like. 

The terms "allele-specif ic oligonucleotide" and "ASO 
refers to oligonucleotides that have a sequence, called 
"hybridizing region, " exactly complementary to the sequence t< 
be detected, typically sequences characteristic of a particula: 

2 0 allele or variant, which under "sequence-specific, stringen- 

hybridization conditions" will hybridize only to that exac 
complementary target sequence. Relaxing the stringency of th< 
hybridizing conditions will allow sequence mismatches to be 
tolerated; the degree of mismatch tolerated can be controlled b] 
25 suitable adjustment of the hybridization conditions. Depending 
on the sequences being analyzed, one or more allele-specif ic 
oligonucleotides may be employed. The terms "probe" and "ASC 
probe" are used interchangeably with ASO. 

A "sequence specific to" a particular target nucleic acic 

3 0 sequence is a sequence unique to the isolate, that is, not 

shared by other previously characterized isolates. A probe 
containing a subsequence complementary to a sequence specific tc 
a target nucleic acid sequence will typically not hybridize tc 
the corresponding portion of the genome of other isolates undei 
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stringent conditions (e.g., washing the solid support in 2XSSC 
0.1% SDS at 70 °C. ) . 

"Subsequence" refers to a sequence of nucleic acids tha 
comprise a part of a longer sequence of nucleic acids . 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
Other features and advantages of the invention will b 
apparent from the following description of the preferre 
10 embodiments thereof and from the claims, taken in conjunctio: 
with the accompanying drawings, in which: 

Fig. 1A shows the preparation of PGR amplif icatioj 
constructs with two different immediately adjacent allelii 
sequences spanned by a single chemically synthesize* 
15 polynucleotide that is used to prepare a longer genomic sequenc< 
for use as two positive control sequences in the method of th< 
invention; 

Fig. IB shows PGR amplification to construct two or mor< 
site-specific allelic sequences within a PGR amplifiabl* 
20 sequence length of up to 20, 000 basepairs for use as tw< 
positive control sequences in the method of the invention; 

Fig. 1C shows splicing in any orientation of two PCI 
amplified fragments each with one or more allele-specif ic sites 
which may include restriction enzyme site(s) for use as two 02 
25 more positive control sequences in the method of the invention; 

Fig. ID shows complementary ligation sequences such as but 
not limited to restriction enzyme site(s) added to the 5' prime 
terminus to facilitate subsequent unidirectional ligation foi 
preparation of two or more positive control sequences according 
3 0 to the invention; 

Fig. IE shows the preparation of homozygous controls at twc 
allelic sites for use as two positive controls in the method of 
the invention that provide sufficient flanking sequences in the 
5 ' and 3 ' direction as required by the assay method and the 
3 5 length of the tested unknown genomic sequences; 
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Fig. IF shows the preparation of heterozygous controls c 
two allelic sites for use as positive control sequences in tt 
method of the invention ; 

Fig. 1G shows adding an additional homozygous mutatic 
5 adjacent to two existing mutations synthesized as in Fig. IE fc 
use as three positive control sequences in the method of tt 
invention; 

Fig. 2 shows site directed mutagenesis with a single prime 
for preparation of one or more positive control sequences to fc 
10 selected and used according to this invention; 

Fig. 3 shows the preparation of four heterozygous positiv 
control sequences according to this invention for use in cysti 
fibrosis testing ; 

Fig. 4 shows mass spectrometry time of flight analysis o 
15 positive control sequences prepared as shown in Fig. IF; and 

Fig. 5 shows a cloning strategy for joining together 1 
fragments that each contains one or more unique allelic site 
for use as positive control sequences in the method of th 
invention . 

20 

DETAILED DESCRIPTION OF THE INVENTION 
Described herein are methods to optimally construct nuclei 
acid sequences for use as multiple positive controls in an 
desired genetic test format in which normal sequences ar 

2 5 distinguished from mutant sequences and/ or polymorphic sequence 

are distinguished from each other, for example, by determinin 
differences in restriction enzyme sites, differences in th 
length of amplified products, or dot blot analysis wit: 
mutation-specific oligonucleotide probes. Other uses include a 

3 0 quality control standards in the manufacture of genetic tests 

or as testing standards by regulatory agencies maintainin 
quality assurance. Compositions according to the invention 
including multiple nucleic acid sequences constructed a, 
described, are the optimal controls for simultaneously testing 
3 5 multiple variable nucleic acid sequences at one or more DN, 

10 
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and/or RNA sites. Setting up a robust DNA test for multipl 
alleles and/or genomic sites requires the use of multipl 
controls that are tested simultaneously to confirm that tt 
assay conditions have been maintained so as to differentiat 
5 reliably among all tested alleles. With the use c 

compositions according to the invention, such controls can t 
synthesized and no longer need to be obtained individually. 

Sequences according to the invention can be prepare 
chemically and/or by PCR amplification for use directly or afte 

10 cloning and propagation. At the same time, some sequences can fc 
PCR amplified and/or cloned directly from total genomic DN 
obtained from an individual carrying the mutation or variant 
Alternatively, the normal sequence to be changed can be clone 
and then modified by site directed mutagenesis. Several singl 

15 mutant or polymorphic sequences that together comprise a pane 
of multiple control sequences can be added individually t 
single site tests or mixed together or ligated together b 
further PCR or by cloning into vectors prior to use i 
individual or multiplex tests . 

2 0 According to the method of this invention, in on 

embodiment any prepared DNA sequence with two or more abnorma 
or polymorphic sites associated with a desired genetic test ca: 
be used as a single control segment that is teste 
simultaneously with unknown samples . These can be representee 
25 by two or more adjacent sites within or between loci withou 
regard to the actual location of these loci in the genome teste* 
so long as the control sequence (s) spans the entire unknow 
sequences tested. For instance, two adjacent sites may be (1 
within the same PCR primer site, (2) within any PCR amplif iabl< 

3 0 length, (3) within any cloned DNA sequence, (4) within an: 

adjacent exons that were spliced and cloned from cDNA, (5 
within the same exon, (6) in an exon and/or an adjacent intron, 
(7) in a gene promoter or enhancer region, (8) in one or more 
flanking gene regions, (9) at existing or new splice sites, (10; 
3 5 in any other two closely spaced adjacent sites, (11) in c 
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selected cytogenetically or pheno typically important chromoson 
region ( s) , or (12) at sites comprising any combination of thes 
categories. Sites may also be found on different chromosomes. 

Adjacent regions of the same gene that are functional: 
5 significant to the DNA assay for the genetic test can be splice 
together in any order to serve as multiple controls so long e 
the individual length of each spliced sequence is sufficient t 
serve as a robust control for the test applied. These clone 
control segments that represent partial mutant gene sequence 

10 will have an analytical function that is derived from 

previously characterized altered biological function, but th 
sequences in different fragments that are combined subsequentl 
will be different spacially. Mutations may be found in an 
functional portion of a gene including exons , splice sites, an 

15 promoters as well as flanking gene regions. Variable di-, tri 
or tetranucleotide repeat alleles may modify gene expression o 
function if the repeat number exceeds a disease-specif i 
threshold or merely be used as very informative markers fo 
identity or gene linkage studies. Frameshift mutations an 

2 0 destruction or insertion of a terminator codon also have a ma jo 
impact on gene function. 

In contrast, analytical function is defined by (1) th 
critical sequence length required for reliable testing, (2) 
readily renewable source of a DNA sequence that can be prepare 

2 5 with high fidelity and validated before use that (3) can be use 

as a standard within and between laboratories to optimize tes 
delivery. Other functionally useful analytical characteristic; 
may include the most common mutations that represent the highes 
proportion of gene abnormalities in a patient population and th 

3 0 number and frequency of the remaining mutations o: 

polymorphisms , 

As will be described more fully in the Examples, Figs, 1A 
IB, and IE demonstrate how to incorporate mutant sequence! 
within a PCR amplifiable sequence length into single DNi 
3 5 fragments with or without normal flanking sequences b] 
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synthesizing site-specific PCR primers. Figs. 1C and 1 

illustrate how individually synthesized primers can be splice 
together. Fig. ID further illustrates a ready means to direc 
splicing fragments in a preselected order and orientation. Fig 
5 IF illustrates how to synthesize heterozygous controls with bot 
alleles in equal quantities within the same PCR amplificatio 
reaction. Fig. 1G teaches how to add one (18'; or more) contro 
sequence adjacent to two existing control sequences (14' an 
15'). The only limitation is that the additional modified DN 

10 sequence (18') being introduced cannot be sufficiently close t 
another allelic sequence being tested (14' or 15 ' ) to interfer 
with the robust analysis of the preexisting control site. 

Although some genes are very large , like the huma 
dystrophin gene which spans >3, 0 00, 0 00 basepairs, only th 

15 length of functionally important gene modifications that can b 
measured readily by the test method are required to be splice 
and placed together as a control. The limit of the lengths tha 
can be spliced together and grown in large quantities in som 
living organism is defined by the capacity of the vector int 

2 0 which the fragments have been cloned. 

Cloning strategies vary according to the vector's clonin 
site(s), the selectable gene markers at these cloning sites, an 
the capacity of the vector (Sambrook, Fritsch, & Maniatis 
1989) . One efficient strategy for combining many differen 

2 5 synthesized fragments would be to clone these into plasmids and 

if need be, into phage or cosmids . The multiple cloning site 
introduced into commercially available plasmid vectors lik 
pUC19 have multiple restriction enzyme sites immediatel; 
adjacent to each other to simplify cloning strategy design.' 

3 0 These constructed vectors carrying multiple control site; 

would then be grown and isolated in an appropriate concentratio: 
to add to control tubes for PCR amplification adjacent to tube! 
with unknown total genomic DNA. The number of control DN, 
sequences per unit volume would be similar to total genomi< 
3 5 target sequences in the unknown tubes. 

13 
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Alternatively, a series of multiplexed PCR primers can t 
used to PCR amplify one, more, or all of the segments that ai 
functionally significant to the test. It must be noted tha 
continued PCR amplification of heterozygous stock PCR produc 
5 might ultimately amplify one allele much more than the othe 
even though the amplification efficiency differs by less tha 
3%. The method in Fig. IF is designed to avoid this possibilit 
and has been effective for all controls made to date 
Nevertheless, this potential problem may be avoided further b 

10 making homozygous PCR products or cloning both fragment 
amplified at each site for continued propagation in bacteria. 

Yet another means to construct controls with multipl 
variant DNA sequences is to first clone or obtain a clone of th 
genomic region to be tested. Then, any mutation, polymorphism 

15 or variant sequence desired can be introduced into the clone 
segment by site directed mutagenesis, as shown in Fig. 2, or b 
the method described to introduce the 18' sequence in Fig. 1G 
Using site directed mutagenesis, the mutagenic primer may hav 
one or more mutant or polymorphic sites. One, two, or mor 

2 0 mutagenic primers may be used simultaneously according to th 
method provided with the kit which is incorporated in it 
entirety by reference (Transformer Site-Directed Mutagenesi 
Kit, Clonetech Laboratories, Palo Alto, CA) . 

As more diseases need to be tested, more mutant sequence 

2 5 can be synthesized and added to the same DNA sample or to a: 

additional tube with other multiple controls including tube 
with very closely spaced DNA sequence changes that woul< 
interfere with each other's reliable analysis if both wer 
tested together on the same DNA fragment. For instance, i: 

3 0 addition to the cystic fibrosis gene, mutations can also b< 

constructed for all designated mutations for other small am 
large genes that when mutated result in diseases like Ret 
Syndrome. Other tests include: (1) multiple genes that whe] 
mutated each result in the same genetic disease phenotype (lik< 
35 Charcot-Marie-Tooth Disease), (2) groups of genetic disease: 
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commonly found in the same ethnic population (like Tay-Sachs 
disease, Gauchier disease, and Canavan disease in individuals of 
Ashkenazi Jewish descent and (3) the most common genetic 
diseases found in the general population (like Duchenne muscular 
5 dystrophy) . Human screening alone encompasses testing women 
before and after conception as well as testing individuals 
during all stages of the life cycle from preimplantation embryos 
to geriatric patients. The same DNA construct carrying multiple 
abnormal and/or normal control sequences can be used to optimize 

10 sample testing and simplify assay design, set-up, and quality 
control for (1) multiple individual assays, (2) assays that test 
multiple locations simultaneously by unique length products 
measured with internal size controls as in gel electrophoresis 
or mass spectrometry, (3) multiplex assays that test several 

15 locations simultaneously as in testing multiple polymorphic loci 
with different colored products or on multiple microchip 
locations or (5) testing the absence, decreased, or increased 
quantity of a product. 

Multiple DNA control segments that have been verified by 

2 0 sequencing can be used as a common standard by all testing 

laboratories to readily compare reliability within and between 
protocols and to optimize the interpretation of (1) mutation 
tests for disease genes like cystic fibrosis, (2) critical 
simple sequence repeat lengths like the fragments containing 
25 trinucleotide repeat numbers (i.e., 34 or 39 in Huntington 
disease), and (3) SNP (single nucleotide polymorphic) alleles 
also known as simple sequence repeat (SSR) alleles during 
identity testing, linkage analysis, or forensic testing. 
Constructing and using DNA fragments with multiple genetic 

3 0 mutations and polymorphisms as control sequences provides a 

ready means to determine whether all laboratories using the same 
protocols obtain the same results. For any one genetic disease, 
all mutations and polymorphic sites can be synthesized and 
tested. Alternatively, a subset of mutant, polymorphic and 
35 variant sites can be selected to identify a substantial 
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proportion of mutations and/or the segregation thereof for any- 
one genetic disease including the most common severe mutations, 
as has been recommended for cystic fibrosis. The mutations 
screened may be found in a group of diseases that are 
5 conveniently tested simultaneously because these genes are 
related technically, functionally, or are the most common in a 
selected target population. This embodiment is applicable to 
population screening as well as individual organism 
identification, classification, and infectious or genetic 

10 disease screening. Tested organisms include all single cellular 
and multicellular organisms at any stage of their life cycle. 

Controls can be designed and synthesized to include all the 
optimal sequences required for mutation and/or polymorphic 
analyses at any locus including gene introns, exons, or flanking 

15 regions and polymorphic sites. Appropriate length artificially 
synthesized nucleic acid sequences from any organism can be 
designed to accommodate the specific testing format. Optimal 
controls can be synthesized with only abnormal sequences, i.e., 
homozygous controls, like those mutations found in both genes of 

2 0 individuals affected with autosomal recessive disease (Fig. 1G) . 
Alternatively, controls can be synthesized with both a normal 
and an abnormal sequence (Fig. IF), i.e., heterologous controls, 
like the mutations found in carriers of autosomal recessive 
disease and in different genes of individuals affected with an 

2 5 autosomal dominant disease. 

Segments from more than one gene can be incorporated into 
the same control so that a single DNA segment can be used as the 
only control required in a multiple gene assay. These 
additional genes may be selected according to any useful 

3 0 criterion. Appropriate groups include genes in the same 

metabolic pathway that each may result in accumulation of a 
metabolite found in excess in the subject, or one of a family of 
genes that each contributes to a biologically active complex. 

The methods described herein can be used for any DNA assay 
3 5 regardless of the organism from which the DNA is derived. Any 
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organism that has ever been characterized and classified is 
included, from single to multicellular organisms, plants, and 
animals as well as their subcellular compartments with heritable 
DNA sequences, like mitochondria. 

5 

The following examples are presented to illustrate the 
advantages of the present invention and to assist one of 
ordinary skill in making and using the same. These examples are 
10 not intended in any way otherwise to limit the scope of the 
disclosure . 

EXAMPLE I 

PGR Amplification Constructs 
15 Immediately Adjacent Allelic Sequences 

As shown in Fig. 1A, the sequences of two or more site- 
specific allelic nucleotide changes (1' and 2') can be 
synthesized directly into a single stranded DNA fragment that 
serves as an homologous PCR primer along with an homologous 

20 primer to the opposite strand spanning the downstream site. 

Even primers exceeding 60 basepairs long spanning more than one 
allelic site have served as unique PCR primers. The selected 
PCR primer pair can be used to amplify unique double stranded 
DNA products of a primer-defined length including normal and 1' 

25 and 2' nucleotide sequences using total DNA as an amplification 
template. Site 3 may represent one or more additional common 
allelic sequence (s) for which a different mutant allele is being 
prepared in a different segment. After multiple PCR 
amplification cycles, >99% of double stranded amplified PCR 

3 0 products of the correct length will include allelic sites l 1 , 2' 
and 3 . (Any single site designation in this entire figure may 
also represent two or more variant locations . ) 

EXAMPLE XI 

17 
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PCR Amplification Constructs Two or More Site-Specific Alleles 
within a PCR Amplif iable Sequence Length 

Referring to Fig. IB, each of two individual primers of a 

complementary pair (4' and 5') can be synthesized to include one 
5 or more allelic sequences selected as a control. After multiple 
amplification cycles, >99% of the unique length double stranded 
fragments synthesized from this PCR primer pair will have both 
site-specific alleles. Any single primer may include two or more 
allelic nucleotide sequences within the single primer sequence. 

10 The minimum distance between two useful different allelic 
sequences in the same primer is determined by the DNA test 
method applied and the length and percentage of sequence 
homology in the DNA sequences that hybridize to the PCR product 
which together determine the melting temperature (Tm) required 

15 to denature hybridized double stranded DNA fragments. The 
specificity of hybridization and denaturation determines the 
reliability of the selected test method to distinguish between 
two different sequences in the subsequent assay . 

20 

EXAMPLE III 

Splicing Together Two PCR Amplified Fragments 
Each with One or More Allele- specif ic Sites 

Referring to Fig. 1C, fragments with two different allele- 

2 5 specific sites are ligated together to form a single unit that 

can serve as a control for both alleles . Nonspecific ligation 

like blunt ended ligation of two different fragments produces 

any one of four products. Any single site designation may 

represent two or more allele-specif ic sequences. 

30 

EXAMPLE IV 

Complementary Ligation Sequences Added to the 5' Primer Terminus 
Facilitate Subsequent Unidirectional Ligation 

35 As shown in Fig. ID, primers that are constructed to have 

overlapping joining segments (illustrated with 3 vertical bars) 

18 
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in the first 6' amplification mix can prime another PCR 
amplified fragment with a complementary segment on one of the 
other primers used in the 7' amplification mix (Fig.. ID, top). 
Then the products can be digested with the same restriction 
5 enzyme selected so that digestion only occurs in the joining 
segments and digested fragments have single stranded 
complementary ends. Subsequently these complementary ends can 
be annealed and ligated unidirectionally to give the first 
product illustrated in Fig. ID. Alternatively, given sufficient 

10 complementary sequence overlap at the 3 ' downstream location of 
the left fragment and the 5' upstream location of the right 
fragment, these two products (Fig. ID, top) can be denatured, 
used to prime each other at location 7', and processed with 
polymerase to complete the synthesis of unique length double 

15 stranded longer products that have both the 6' and 7 7 allelic 
sequences ligated unidirectionally (Fig. ID, bottom product) . 
The number of different fragments that can be ligated to a 
unique location in any construct by this protocol is dependent 
upon the number of sequences that are required to assure unique 

20 non-homologous binding to one other fragment. This protocol 
provides sufficient capability to join any two fragments 
uniquely in any construct that can be envisioned. Site-specific 
joining sequences may be synthesized on the end of each primer 
so that multiple segments can be ligated and the final construct 

2 5 readily ligated into a vector for growth of commercial 

quantities . 

EXAMPLE V 

Homozygous Controls at Two Allelic Sites 

3 0 Homozygous controls sufficiently close to the end of the 

required length of a PCR amplified product to be incorporated 
into an artificially synthesized PCR primer can be used to PCR 
amplify a homozygous control as in Fig. IB. In contrast, when 
one or more homozygous control site(s) is required to be in the 
3 5 center of a longer length fragment, more than one approach can 

19 
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be used to synthesize this control sequence. (See, for example, 
Fig. IE for the synthesis of a homozygous control and Fig. 1G 
for the synthesis of a heterozygous control) . Fig. IE shows 
that two complementary primers that include the variant sequence 
5 can be constructed that will subsequently bind to each other and 
serve as primers for PGR synthesis of one larger fragment with 
longer flanking sequences on both sides. The forward primer PL 4 
(Primer Left 4) along with the reverse primer with sites 14' and 
15 ' amplifies the 5' double stranded DNA region in Reaction A. 

10 In a separate reaction tube the forward primer with sites 14' 
and 15' along with the reverse primer PR4 (Primer Right 4) 
amplify the 3 7 double stranded DNA region in reaction B. After 
these amplifications are complete, aliquots from, reactions A and 
B are mixed, denatured, annealed, and the polymerase extends the 

15 products from reaction A and B that anneal at the 14' and 15' 
primer sites (Figure IE, IV) . Now, additional primers PL4 and 
PR4 are added and PCR amplification continues, resulting in the 
synthesis of a product of which >99% is homozygous for sites 
14' , 15' , 16 and 17 . 

20 

EXAMPLE VI 

Heterozygous Controls at Two Allelic Sites 

As shown in Fig. IF, primers PL 4 and 12' together amplify a 
homozygous fragment with a variant sequence at site 12 and a 

25 common allelic sequence found in the total genomic DNA at site 
11 (Fig. IF, Reaction D) . PCR primers 11' and PR4 amplify a 
homozygous fragment with a variant allelic sequence at primer 
site 11' and a common allelic sequence in the total genomic DNA 
being amplified at site 12 (Fig. IF, Reaction E) . These PCR 

3 0 reaction products can be synthesized and used independently as 
homozygous controls. These can also be mixed and PCR amplified 
as in Reaction F for use as full length heterozygous controls 
for sites 11' and 12' when sufficient flanking sequence is 
included in both products . 
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Products from Reactions 1 and 2 can also be ligated to each 
other as in Fig. 1C, grown in a vector or amplified again, and 
also used as heterozygous controls. 

When control fragments are required that span ■ the entire 
5 sequence from PL 4 through PR4 that must include 11, 11' , 12, and 
12' sequences, these controls can be synthesized from Reactions 
1 and 2 using additional PL4 and PR4 primers to synthesize a 
mixture of double stranded DNAs that are completely 
complementary in both strands and >99% of which span PL 4 and PR4 
10 (Fig. IF, bottom) . This product mixture is suitable for use as 
a heterozygous control for standard PCR assays with primers Fl 
and R2 followed by restriction enzyme analysis at both sites 11 
and 12 . This method was the first one enabled by PCR 
amplification and included the G542X, 1717-1G->A, S549N, and 
15 R56 0T loci These initial products were verified as heterozygous 
by both restriction enzyme analysis and by mass spectrometry 
(Fig. 3). (Note: Any single site designation may represent two 
or more locations of the same category. Any synthesized primer 
may have an extra segment on its 5' end for subsequent 

2 0 engineering.) 

EXAMPLE VII 

Adding Another Mutation adjacent to Two Existing Mutations 

Referring to Fig. 1G, an existing DNA fragment with 
25 mutations at sites 14 ' and 15' is constructed as in Fig. IE. 
Forward and reverse primers spanning the 18' mutation locus to 
be modified are selected, ordered and synthesized. PCR primers 
18' and PR4 are used to amplify a homozygous fragment with a 
variant allelic sequence at primer site 14' and 15 ' and a common 

3 0 allelic sequence in the total genomic DNA being amplified at 

site 16 (Fig. 1G, Section II) . 

As shown in Fig. 1G, primers PL 4 and 18' together amplify a 
homozygous fragment with a variant sequence at site 18 and a 
common allelic sequence found in the total genomic DNA at site 
35 17 (Fig. 1G, Section III) . Aliquots of both amplified fragments 
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are added to another reaction tube with additional PL 4 and PR4 
primers, and PCR amplification continues until all the amplified 
fragments have mutations 18' , 14' and 15' flanked by normal 
locations 17 and 16. The limit to this method is that a new 
5 mutation cannot interfere with the assay of an adjacent 
mutation. In that case, an additional fragment must be mutated 
for use either in the same or a separate control reaction mix. 
If the additional fragment is used in the same reaction mix, 
then that mix becomes heterozygous for the two mutations that 
10 were synthesized on separate fragments. Alternatively, the two 
PCR reaction products can be synthesized and used independently 
as homozygous controls . Any synthesized primer may have an 
extra segment on its 5' end for subsequent engineering, as in 
Fig. ID. 

15 

EXAMPLE VIII 

PCR Amplification Products of Heterozygous controls 

This more complex example, illustrated in Fig. IF, was 
enabled by constructing a mixture of two products carrying both 
20 the normal and mutant sequences at four CFTR gene sites: 1717- 
1,G->A; G542X; S549N; and R560T. PCR was used to amplify one 310 
bp fragment with two mutations under the F2 primer site (1717-1 
and G542X) . In a separate reaction, a 213 bp fragment carrying 
two mutations in the Rl primer (S549N and R560T) were also 

2 5 amplified. These amplified fragments were confirmed to be the 

correct length as determined by agarose gel electrophoresis of 
the individual products . Then these two amplification products 
were mixed and further amplified with additional Fl and Rl 
primers to give a mixture of two new full length products each 

3 0 425 bp long. Each product has two mutated sites and two normal 

sites. These products both exist in the reaction mix (IV) and 
co-migrate on agarose gel electrophoresis (Fig. 3, Lane IV). 

Specifically, the first primer incorporates both the G542X 
and 1717-1 G->A mutation sequences: 11' (Fl)-AGA CAT CTC CAA GTT 
3 5 TGC AGA GAA AGA CAA TAT AGT TCT TTG AGA AAG . The second primer 
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incorporates the S549N and R5 6 0T mutation sequences in the 
reverse complimentary strand direction: 12'(R1)-ACG TTG CTA AAG 
AAA TTC TTG CTC GTT GAC CTC CAT TCA GTG TGA. The 11 ' (Fl) and 
12' (Rl) PCR primers can be used as a pair to amplify a single 98 
5 bp sequence including all four mutations (Fig. IB) . After 
amplification of a homozygous 9 8 bp product, two additional 
primers (PL4(F2) and PR4(R2)) can be added to the synthesized 
fragment with total DNA to amplify a 425 bp fragment that spans 
the region carrying these four cystic fibrosis mutations: 

10 PL 4 (F2) -GAA GGA AGA TGT GCC TTT CAA and PR4 (R2 ) -AGC AAA TGC TTG 
CTA GAC CA (Fig. IE) . 

Alternatively, Fl and R2 can be used to amplify total DNA 
in one tube and F2 and Rl to amplify DNA in a second tube. This 
will amplify all synthesized products, half that carry the G542X 

15 and 1717-1G->A mutations and the other half that carry the S549N 
and R560T mutations (Fig. IF, Fig. 3) . Then, these sequences 
can be mixed together to amplify fragments with both normal and 
abnormal sequences at the same four loci in the same tube. If 
only the amplified fragments are used as primers to amplify 

2 0 full-length fragments from each other, then the resulting double 
stranded DNA fragments will be different from each other. These 
will serve as controls for ASOs and mass spectrometry. In order 
to use this as a control for restriction enzyme analysis, 
additional PCR amplification with PL4(F2) and PR4(R2) primers is 

2 5 required to produce completely complementary double stranded PCR 

products (Fig. IF, bottom, Fig. 3 -IV) . 

This PCR strategy was then used to synthesize 43 fragments 
carrying 93 mutations (Table 1) , with 1 to 5 mutations included 
in the PCR amplified products at each of the 43 gene sites. 

3 0 Thus, no limitations have been encountered. At the same time, 

mutations in the same gene region disrupt the cftr gene function 
so that multiple mutations have been found in adjacent amino 
acid sites. Thus, among the 43 fragments synthesized, 4 

fragments were synthesized with 5 mutation sites each. A 
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fraction of these 43 PCR amplified fragments have been used as 
mass spectrometry controls on a routine basis. 

5 Mass Spectrometry Time of Flight An.alys.is 

of Heterozygous Control 

Analysis of product PL4-12' fragment (Example VI; Figure 3) 

used to amplify product ll'-PR4 (Figure 4) shows (A) the normal 

sequence and (B the synthesized mutant sequence at one selected 

10 control site along with (C) a longer normal sequence at an 
adjacent different mutation site in the DNA that was not 
modified in the PCR amplified product illustrated in Figre 3. 
All ; of the heterologous controls synthesized in Example VI gave 
two r peaks for each locus. Similar experiments were subsequently 

15 carried out for all mutations of Table 1, with similar results. 
This confirms that, both normal and abnormal sequences are found 
at: both locations. 
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TABIiE Ir 


Selected Examples of Syn 


th.es i zed CFTR Mutations. 




39 4delTT 


405 + lG-->A 


G91R 


405+lG->A 


4 57 TAT -G 


I148T - 


Y122X 


E92X 


574delA 


444delA 


R117H 


■621+1 


G17 8R 


. 711+5G- : >A 


711+1 ,G->T 


712-1, G->T 


Ij2 0f5w, 


G330X 


R374H 


T3 3 8I 


T3 60K* 


1078delT 


R334W 


Q359K 


T3 60K 


R347P 


I336K 


P.3 52Q 


33S4P ■ 


1154insTC 


5T ( delTT ) 


A455E 


Q493X 


DF508 


F520I 


DF507 


G480L 


V520F 


1609delCA 


Q493R 


677delTA 


C524X 


R533X 


H560K 


G551D 


R560T 


1717-1, G->A 


G542X 


Q552X 


A559T 


S549N 


S549I 


S549R 


A561E 


E585X 


1898+5, G->T 


P574H. 


1898+1, G->A 


1812-1,G->A 


Y5 63D 


A4 9 delS 4 


2184delA 


2043delG 


2 14 3 del T 
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2183AA->G 
2176InsC 
2 869lnsG 
W1092X 
37 91delC 
3849+4, A- >G 
D12 7 0N,G->A 
S1251N 



K710X 

2789+5, G->A 

R1066C,C->T 

R1158X 

3659delC 

3821delT 

W1282X 



2 3 07 Ins A2 
Q890X 

W1089X,G->A 
3 662delA 
S1196X 
S1235R,T->G 
4005+l,G->A 



1841ns A 

5945L,C->T 

T1070Q 

Q1238X 

R1162X 

3 849 ,G->A 

W1282R 



10 Note: 

All of these mutations were introduced so that both the normal and 
abnormal sequences were present in the amplified product. While this is the 
form of control that always tests any subsequent PGR to determine that both 
the abnormal and normal target sequences will amplify efficiently, only 
15 homozygous controls are required to meet all College of American Pathologists 
requirements. Therefore substituting only fragments with abnormal sequences 
would simplify the routine use of these multiple synthesized control DNA 
sequences. Whether homozygous, heterozygous or both types of controls are 
preferred depends upon the testing method. 

20 

EXAMPLE X 
Cloning Strategy for 16 fragments 

Cloning provides high fidelity biologically replicated 
sequence as a permanent renewable resource. Thus the controls 

2 5 can be relied upon when testing the reliability of the assay 

system. Aliquots of living organisms carrying the clone can be 
stored indefinitely to replace the growing stock in the event 
that a single basepair change occurs in a subsequent subculture. 
At the same time, the fidelity of this replication is confirmed 

3 0 by every assay for which the cloned sequences are used as a 

control . 

Judicious primer sequence selection, enzyme digestion, 
annealing, cloning, selection, characterization, excision, and 
ligation to other cloned fragments provides a straightforward 
3 5 strategy to clone multiple control sequences into a single 
vector for subsequent propagation and use. To date more than 43 
fragments with 1 to 5 mutations in each fragment have been 
synthesized. Given that the 92 mutations have been synthesized 
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into 43 fragments that average 3 00 basepairs in length, a 
general strategy for cloning fragments with these 
characteristics follows . 

Vectors with multiple cloning sites were developed in order 
5 to insert nearly any DNA fragment in the direction desired and 
to prevent ligation of the vector to itself to minimize 
background clones when searching for clones with inserted DNA. 
Directional cloning with different restriction enzyme sites on 
both sides of the cloning site is particularly useful to splice 

10 adjacent gene sequences together and to place sequences into 
protein expression vectors that need to express the sense 
strand. In contrast, the DNA fragments carrying the mutations 
merely require sufficiently long flanking sequence to be 
recognized by the assay. Unlike the actual gene sequence which 

15 must be intact to remain functional, the order and orientation 
of different synthesized fragments carrying mutations is 
unimportant so long as the sequences flanking each mutation are 
sufficiently long to be tested unambiguously by the assay 
employed. A restriction enzyme that cuts DNA at a recognizable 6 

2 0 basepair restriction site will cut on the average of once every 

(4) 6 bp or 4096bp or 4. 1Kb. Very useful plasmid vectors on the 
order of 3 kb long are available with multiple cloning sites 
that are not found elsewhere in the vector's genome. Cloning 
sites typically have restriction fragment recognition sites of 6 
25 basepairs or longer. Multiple cloning sites in selectable gene 
regions also typically have recognition sites of 6 bp or longer. 

Small fragments are cloned first and then ligated to other 
fragments and cloned into another vector. As cloning of multiple 
fragments continues, the total fragment length becomes longer so 

3 0 that the vector category required to accommodate the insert may 

change. Plasmids can typically accommodate up to 10 kb of 
cloned DNA fragment, selected phage up to 2 5 kb of cloned 
insert, and cosmids up to 45 kb of inserted DNA. 

The plasmid vector of choice will have a multiple cloning 
35 site and a readily selectable locus that when disrupted by a DNA 
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insert can be distinguished from the phenotype when the vector 
is intact. One useful multiple cloning site and primer binding 
region in pUC18 and pUC19 has the same multiple cloning site in 
opposite directions in the center of the alpha-pep tide of the 
beta-galactosidase gene. Disruption of this site produces 
colorless rather than blue colonies on medium containing 
ampicillin and X-gal . Therefor, recombinant clones can be 
distinguished readily from nonrecombinant clones. 

A restriction enzyme site is then selected that is not 
found in the cloning vector or in the multiple cloning site. 
For instance, in the case of this illustration with pUC18 or 
pUC19, NotI (restriction site GCGGCCGC) is anticipated to cut 
once every 4 8bp = 65, 53 6bp. 

Then PCR primers are selected with restriction enzyme site 
sequences flanking the gene amplification sequences. For 
instance, if Hindi 1 1 and SphI at locations 398 and 404 in the 
pUC18 vector are restriction sites b and c in Fig, 5, then the 
PCR primers are synthesized with the Hindlll sequence flanking 
the forward PCR primer for fragment 1 and the NotI sequence 
flanking the reverse PCR primer for fragment 1 . These primers 
will be used to PCR amplify fragment 1 • f rom total genomic DNA . 
At the same time, the forward amplification primer for fragment 
2 also has the NotI sequence at the 5 ' end and the fragment 2 
reverse primer an SphI sequence at the 5 7 end. After PCR 
amplification of a unique gene target from total genomic DNA, 
the two fragments are then digested with NotI, Hindlll, and 
SphI, and unincorporated primers, nucleotides, and restriction 
endonucleases are removed from both amplified fragments. After 
removal, the digested fragments 1 and 2 are ligated to each 
other at the NotI site and ligated into a pUC18 vector digested 
with Hindlll and SphI. Recombinant clones are screened until 
colonies are found with fragments 1 and 2 ligated together. 
Then, one of these colonies can be grown up and the inserted 
fragment excised for subsequent cloning. 
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In an analogous fashion , fragments 3 and 4 are amplified 
from total genomic DNA with restriction site sequences SphI and 
NotI on fragment 3 and PstI and NotI on fragment 4 and cloned 
into another pUC18 vector at the SphI and PstI sites. This 
5 clone is then grown up, the insert excised and purified, and 
then fragment 1-2 is ligated at the SphI (V) restriction 
enzyme site to fragment 3-4 (Fig. 5, line 2. After ligation to 
produce fragment 1-2-3-4, this fragment is subcloned into 
another pUC18 vector digested with Hindi I I and PstI (Fig. 5, 
10 line 3) . 

The same strategy is used to synthesize fragment 5-6-7-8 
(Fig. 5, Lines 1,2,3). Then fragment 1-2-3-4 is ligated to 5-6- 
7-8 and subcloned into a Hindlll (b) and Xbal(f) digested pUC18 
vector. The same strategy is repeated until fragments 1-16 are 
15 subcloned into a single pUC18 vector at the Hindlll and EcoRI 
sites. ( As the PstI and Sail dual restriction enzyme digests 
work at very low efficiencies together, the Sail site will not 
be used.) Together these 16 fragments will yield a cloned 
insert about 4.8 kb long (300 bp X 16 fragments). 

2 0 The 16 fragments that are cloned are selected so that none 

has an EcoRI or Hindlll restriction enzyme sit. Then, this 
vector, with the 16 fragments, can be grown up and the 16 
fragment insert excised intact for further cloning. Since the 
43 fragments that average 3 00 basepairs are anticipated to span 
25 about 12,900 basepairs when placed end to end, this restriction 
site can be selected because it is not expected to occur within 
any of the 43 fragment sequences constructed. This enzyme 
selection would be designated W A" in Fig. 5. Then the 43 
amplified heterozygous fragments with two fragments in each PCR 

3 0 amplified product would be organized into groups of up to 16 

fragments with one or two PCR fragments for every amplified 
mutation locus. This would give six groups containing 86 PCR 
amplified fragments . 

Additional control sequences can be added to the original 
3 5 plasmid by splicing in other prepared control sequences. 
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Cloning multiple abnormal gene sequences into the same vector to 
be used as controls for multiple mutation tests merely requires 
that each abnormal sequence be present with sufficient flanking 
sequences to accommodate the requirements of the test design and 
5 test method employed including PCR amplification followed by 
restriction enzyme analysis, ASO, time-of -flight mass 
spectrometry, or direct or quantitative fluorescence analysis of 
unique fragments by multicolor fluorescence analysis. 

Even though the order of the cloned gene fragments is 

10 generally unimportant for synthesized DNA sequence controls, 
multiple fragments can be cloned and ordered as desired within 
single vectors with multiple cloning sites for ease in 
construction. For instance, pUC18 and pUC19 plasmids have a 
multiple cloning site with multiple restriction enzyme sites 

15 listed in order: (b)Hindlll, (c)Sphl, Sse8387, (d)Pstl, Hindi, 
(e)Xbal, (f)BamHI, (g)Smal, (h)Kpnl, (i)Sstl, and (j)EcoRI. If 
the unidirectional cloning site designated by vertical bars in 
Fig. ID is recognized by a restriction enzyme with an 8 basepair 
specificity like NotI, which cuts GC/GGCCGC sequences (Enzyme 

20 "a" in Fig. 5), then fragment 1 can be spliced with fragment 2, 
segment 3 with fragment 4, segment 5 with segment 6, etc. to 
convert the 16 different fragments (Fig. 5, Line 1) into 8 
spliced fragments in 8 different reaction tubes (Fig. 5, Line 
2) . These 8 fragments can each be subcloned into pUC18 at the 

2 5 multiple cloning site digested with two enzymes along with the 

fragment spliced at restriction site vx a." Thus fragment 1,2 is 
cloned into the b-c (Hindlll-SphI ) cloning site and fragment 3-4 
is cloned into the c-d ( Sphl-PstI) cloning site. These are grown 
in sufficient quantity to confirm the cloned fragments are the 

3 0 correct sequences in the predetermined order in the multiple 

cloning site. Then, these cloned segments are excised from the 
vector by digestion with the two enzymes sites used to clone the 
fragment into the vector, the insert from the vector is 
purified, and fragment 1-2 is ligated to fragment 3-4 at the 
35 c(Sphl) restriction site (Fig. 5, Line 3). At the same time 
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fragments 5-6-7-8, 9-10-11-12, and 13-14-15-16 are also ligated 
by an analogous strategy. Then, pUC18 is digested with enzymes 
b(Hindlll) and d(Pstl), purified, and ligated to fragment 1-2-3- 
4 at these two sites. This strategy produces a pUC18 vector 
5 that has a cloned 1-2-3-4 fragment. Again, the remaining 
fragments 5-6-7-8, 9-10-11-12, and 13-14-15-16 are ligated to a 
pUC18 vector digested with the two appropriate enzymes . These 
ligated fragments are cloned into bacteria and grown up as 
before to produce sufficient quantities of vector plus insert 

10 for the subsequent cloning step. Cloned fragment 1-2-3-4 will 
be excised from its vector using restriction enzymes b and d, 
separated from the vector, and ligated to excised fragment 5-6- 
7-8 from another clone (Fig. 5, lines 3-4) . By repeating this 
strategy as drawn, 16 fragments can be ligated together into a 

15 single fragment flanked by b (Hindlll ) - j (EcoRI ) restriction 
enzyme sites that can be used to cut this fragment only at the 
ends and the pUC18 vector at the multiple cloning site only and 
clone the two together. The only prior planning required is to 
determine that any fragment for which a restriction enzyme is 

2 0 subsequently used to splice that fragment from the vector does 

not contain that restriction site. For instance, synthesized 
fragment 1 cannot have b, c, d, f, or j restriction sites within 
its sequence. Otherwise, subsequent digestion with any of these 
restriction enzymes used to construct the long contig would also 
25 cut the fragment into smaller units. At the same time, fragment 
12 should not be cut by restriction enzymes h, g, f, and j 
anywhere but at the end sites digested for subsequent cloning. 
This requirement is not anticipated to interfere with cloning 
because all of the restriction enzyme sites in the multiple 

3 0 cloning site recognize 6 or 8 basepairs . A restriction enzyme 

that recognizes a 6 basepair sequence is anticipated to cut on 
the average every (4) 6 or 1024 basepairs. The average cloned 
fragment with cystic fibrosis mutations is 30 0 basepairs. 
Therefore an enzyme that recognizes a 6 basepair site is 
3 5 anticipated to cut only once in about 1 of 13 synthesized 
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fragments . For a fragment that is cut by restriction enzyme b or 
c, another restriction enzyme cascade like f, g, h, and j are 
anticipated to be appropriate substitutes for cloning into a 
final contig of fragments 1-16. To date, 43 fragments have been 
5 synthesized with cystic fibrosis mutations. These are 

anticipated to be cloned readily into three different plasmid 
vectors. The order in which the restriction sites are used can 
be modified to accommodate all fragment sequences. Furthermore, 
other vectors like pBR3 22 can be substituted for pUC18 as 

10 required because this vector also has other cloning sites 
including 7 cloning sites found within the tetracycline 
resistance gene and several other sites flanking this gene. 
Upon growing these plasmid vectors with inserts to sufficient 
quantity for routine analysis as controls, these could be 

15 digested with enzyme selected because it does not cut within the 
16 cloned fragment sequences. This would provide linearized 
cloned fragments with multiple • cloned fragments ready for 
analysis . 

The strategy used to ligate and clone 16 fragments would 

20 give a total mutation-carrying fragment of about 4.8 kb if each 
fragment were approximately 3 00 bp. In order to splice together 
2 or more of these fragments, different restriction sites can be 
used within pUC18 or in other plasmid vectors like pBR322 that 
hold up to 10 kb readily. Two different synthesized fragments 

25 with 16 contiguous control sequences could be excised at two 
different enzyme sites like Hindlll and EcoRI, spliced together 
at the Hindlll site, and cloned into another plasmid vector at 
the EcoRI site because these vectors are reported to hold up to 
10 kb readily. This description should not be construed to 

3 0 exclude using other vectors with cloning sites that modify 
selectable gene markers that can be used just as effectively as 
pUC18 or pBR322. Furthermore, a multiple cloning site with rare 
restriction enzyme cutting sites may be synthesized and used 
with much less regard for the sequences in each synthesized 

3 5 control sequence. 
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The next step requires ligating two fragments and cloning 
these into a plasmid, or phage. Then two or three different 
phage inserts can be introduced into cosmids . Judicious 
selection of restriction enzyme cloning sites can accomplish 
5 this goal. For instance, plasmid vectors can be induced to 
accommodate up to 25 kb but at a much lower efficiency like the 
25 kb cloned McArdle gene sequence used for gene mapping (Lebo 
et al., 1984) . Phage vectors which hold up to 22 kb, and cosmid 
vectors hold about 4 0 kb. The protocols for growing each of 

10 these vectors differs (Maniatis, 1982; Sambrook et al . , 1989). 
Each has been well characterized and has been used readily for 
cloning with high efficiencies, even starting with less than 5 
micrograms DNA extracted from flow sorted human chromosomes 
(Lebo et al . , 1986). Then, large quantities of the cloned 

15 insert can be grown, either as suspension bacterial cultures 
(plasmids and cosmids) or plaques (phage) for preparation of 
large quantities of insert for use either as controls or for 
ligating to other cloned control fragments into a vector that 
accommodates even larger inserts like BACs or YACS . 

20 

USE 

Obtaining these allelic sequences in total DNAs from 
individuals in the population is seldom possible without prior 
extensive testing of many individuals. In the end, screening may 

25 still fail to identify all of the different control alleles. 
Nevertheless, agencies regulating molecular genetic laboratories 
have required that controls be included within each genetic 
test. In contrast to prior art control protocols, for which 
each cell line must be grown and the DNA extracted prior to use 

3 0 in multiple assays in order to test all available controls, 
compositions of positive control sequences according to the 
invention are easy to prepare and afford multiple advantages. 
For example cloning and propagating all DNA control sequences 
synthesized removes the concern for adding PCR amplified 

3 5 products at the beginning of a PCR assay where the smallest 
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aerosol droplet that finds its way into an assay tube with an 
sample of unknown genotype will change the test result. 
Additionally, the overall experimental design of a genetic test 
is simplified substantially by using controls with multiple 
5 allelic sequences according to this invention because multiple 
controls either minimize the number of control assays required 
in multiplex tests or simplify control sample addition to 
multiple tubes . 

PCR is an extremely sensitive and rapid means to analyze 

10 DNA and is used whenever possible to enrich the sequence to be 
analyzed about 1,000,000-fold in standard analyses and about 
1,00 0,000,000-fold in single cell analyses so the target 
sequences can be assayed more readily. Thus, PCR is used for 
about 80% of the molecular genetic analyses in diagnostic 

15 laboratories and is applied routinely in forensic and identity 
testing. Since many tests are based upon PCR amplification, 
total genomic DNA controls have been preferred or required by 
regulatory agencies in the past over PCR amplified controls 
because PCR amplified controls have so many amplified copies 

2 0 that a single droplet of aerosol that travels from the PCR 
amplified DNA into another tube with an untested sample can 
change the result of the sample test. However, the Molecular 
Genetics Committee of the College of American Pathology has 
agreed unanimously to allow the use of artificially synthesized 

2 5 controls prepared according the method of the invention. Any 

mutant or polymorphic allelic sequence that is required as a 
control to conduct a robust DNA analysis can be synthesized and 
subsequently cloned whenever desired according to the methods of 
the invention. Thus, synthesized controls can be used to test 
30 any genetically transmitted nucleic acid sequence, e.g., from 
the nucleus or from an organelle, from any organism. 

Following the methods taught herein, PCR can be used to 
synthesize many copies of any desired abnormal sequence from 
well selected short primers that are synthesized artificially 

3 5 and used to PCR amplify short abnormal sequences into longer 

33 



WO 03/078570 



PCT/US02/21506 



normal gene sequences found in total genomic DNA in many- 
individuals of the species being tested. These PCR amplified 
products can be added later in any assay as a control . At the 
same time, these products can be cloned together into a single 
5 DNA fragment with multiple segments that can be propagated 
biologically in a single cell organism like bacteria or yeast, 
isolated, and used at the same target DNA sequence number per 
unit volume. Whenever desired, total genomic DNA from an 
unrelated species can be added to the control DNAs so that the 

10 total number of nucleotides per unit volume is identical to the 
number of nucleotides in the uncharacterized genomic DNA samples 
being tested. With one or both of these modifications, the 
multiple control sample can be added at the beginning of any 
assay, including a PCR amplification assay, at the same time as 

15 a total genomic DNA sample with an unknown genotype is being 
tested. 

Additionally, the overall experimental design of a genetic 
test is simplified substantially by using controls with multiple 
allelic sequences according to this invention because multiple 
2 0 controls either minimize the number of control assays required 
in multiplex tests or simplify control sample addition to 
multiple tubes. For example, some currently available formats 
for the 25 mutation cystic fibrosis test analyze all 25 
locations in genomic DNA simultaneously (Roche) . A control that 

2 5 has all 2 5 mutation sites in the same DNA fragment constructed 

according to this invention can be used once in each multiplex 
assay to determine simultaneously whether the assay is robust at 
all 25 sites, saving substantial cost and analysis time. In 
other formats, the assay requires individual tests for each 

3 0 mutant or polymorphic locus. A control according to the 

invention as above can still be used, i.e., the same control can 
be added with each site tested so that assay design and analysis 
is simplified. 
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While the present invention has been described in 
conjunction with a preferred embodiment, one of ordinary skill, 
after reading the foregoing specification, will be able to 
effect various changes, substitutions of equivalents, and other 
5 alterations to the compositions and methods set forth herein. 
It is therefore intended that the protection granted by Letters 
Patent hereon be limited only by the definitions contained in 
the appended claims and equivalents thereof. 
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CLAIMS 

What is claimed is: 

1. A method of carrying out a genetic test on a subject or 
subjects, said method comprising: 

selecting one or more inherited nucleic acid loci of said 
subject or subjects to be examined in said test; 

identifying multiple mutations or polymorphisms associated 
with said one or more loci; 

conducting said test; and 

using as a positive control in said test one or more 
nucleic acid fragments comprising three or more of said 
mutations or polymorphisms associated with said one or more 
loci . 

2. The method of claim 1, wherein one or more of said 
fragments comprises two or more of said mutations or 
polymorphisms associated with said one or more loci . 

3 . The method of claim 1 , wherein one or more of said 
fragments comprises only one of said mutations or polymorphisms 
associated with said one or more loci. 

4. The method of claim 1, wherein said genetic test is 
selected from the group consisting of individual or organism 
identification, pedigree relationship determination, genetic 
disease identification, population screening, classification and 
infectious disease identification. 

5. The method of claim 4, wherein said genetic test is to 
detect cystic fibrosis, Rett syndrome or Huntington Disease. 
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6. The method of claim 4, wherein said genetic test includes 
multiple gene locations, any one of which, when mutated, results 
in the same genetic disease phenotype . 

7. The method of claim 4, wherein said genetic test is for 
groups of genetic diseases commonly found in the same ethnic 
population. 

8. The method of claim 1, wherein said subject or subjects is 
a virus or a single cellular organism. 

9. The method of claim 1, wherein said subject or subjects is 
a multicellular organism. 

10. The method of claim 9, wherein said nucleic acid is from a 
chromosome or an organelle of said subject or subjects. 

11. The method of claim 10, wherein said organelle is a 
mitochondrion . 

12. The method of claim 9, wherein said multicellular organism 
is a vertebrate. 

13. The method of claim 12, wherein said vertebrate is a bird 
or a mammal . 

14. The method of claim 13, wherein said vertebrate is a human 
or an animal or a plant domesticated by humans. 

15. A method of preparing a positive control for a genetic 
test, wherein said positive control is a nucleic acid fragment 
comprising two or more mutations associated with said test, said 
method comprising: 
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selecting two PCR primer sequences, with one or bot! 
primers comprising one or more of said mutant or polymorphii 
sequences; and 

carrying out either a nucleic acid amplification reactioi 
5 using said primers so as to amplify the nucleic acid sequence 
between said primer sites of an individual of the species to be 
tested, or site directed mutagenesis using said primers and DNi 
polymerase followed by cloning of the resulting sequence. 

10 16. The method of claim 15, wherein said nucleic acid sequence 
is from total genomic DNA of said individual. 

17. The method of claim 16, wherein said nucleic acid sequence 
is DNA. 

15 

18. The method of claim 16, wherein said nucleic acid sequence 
is RNA. 

19. The method of claim 15, wherein said nucleic acid sequence 
2 0 is from an organelle of said individual. 

20. The method of claim 19, wherein said organelle is a 
mitochondrion . 

2 5 21. The method of claim 15, wherein said nucleic acid sequence 

is a clone from total genomic DNA of said individual. 

22. The method of claim 15, wherein said nucleic acid sequence 
is synthesized. 

30 

23. A composition comprising a panel of positive controls for 
use in carrying out a genetic test on a subject or subjects, 
said panel comprising: 

one or more nucleic acid fragments comprising three or more 

3 5 mutations or polymorphisms from multiple mutations or 
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polymorphisms associated with one or more inherited nucleic acid 
loci of said subject or subjects to be examined in said test. 

24. The composition of claim 23, wherein one or more of said 
5 fragments comprises two or more of said mutations or 

polymorphisms associated with said one or more loci. 

25. The composition of claim 23, wherein one or more of said 
fragments comprises only one of said mutations or polymorphisms 

10 associated with said one or more loci. 

26. The composition of claim 23, wherein two or more of said 
fragments comprise five or more of said mutations or 
polymorphisms associated with said one or more loci. 

15 

27. The composition of claim 23, wherein each of said fragments 
comprises only one of said mutations or polymorphisms associated 
with said one or more loci . 

20 28. The composition of claim 23 comprising more than 40 
fragments . 

29. A kit for carrying out a genetic test on a subject or 
subjects, said kit comprising: 

25 the composition according to claim 23; and 

amplification reagents. 

30. The kit of claim 29, further comprising a thermostable 
polymerase, restriction enzymes and instructions for conducting 

3 0 said genetic test. 
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